CDS

Accession Number TCMCG022C36668
gbkey CDS
Protein Id XP_039166021.1
Location join(8900640..8900836,8901007..8901128,8901274..8901389,8901865..8901938,8902075..8902339,8902457..8902588,8902798..8902962)
Gene LOC104436415
GeneID 104436415
Organism Eucalyptus grandis

Protein

Length 356aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA698663
db_source XM_039310087.1
Definition thiol protease aleurain [Eucalyptus grandis]

EGGNOG-MAPPER Annotation

COG_category O
Description Belongs to the peptidase C1 family
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
KEGG_ko ko:K01366        [VIEW IN KEGG]
EC 3.4.22.16        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko04142        [VIEW IN KEGG]
ko04210        [VIEW IN KEGG]
map04142        [VIEW IN KEGG]
map04210        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCTCGCGCGAGGCTCCTGTGCTCCGCCGTCCTCCTCCTCGTCGCCGTCGCCGTCTCCGCCGCGGCGTCGAGCTTCGAGGAGTCCAACCCCATCCGGCTCTTCCCCGACGGCGGCCTCCGCGACCTCGAGTCCTCCATCGTCCAGATCGTCGGCCGCACCCGCCACGCCTTCTCCTTCGCCCGCTTCGCCAACAGGTATGGGAAGAGGTACGAGACCGCGGAGGAGATCAAGCTGCGGTTCGAGATCTTCAGGGAGAATCTCAAGTTGATCCGATCCACCAACAAGAAGGGCTTGCCCTACACCCTCGGTGTCAATAAGTTTGCTGATTGGAGCTGGGAGGAGTTCAGGAGGCACAGACTGGGAGCTGCTCAAAACTGCTCTGCCACCACCAAGGGCAACCACAAGCTCACCGACGAAGCTCTTCCCGAGATGAAAGACTGGAGAGAAAAGGGCATTGTAAGCCCAATTAAAGATCAGGGGCACTGTGGATCTTGCTGGACTTTCAGTACCACTGGAGCTCTTGAGGCTGCTTATCACCAAGCATTCGGGAAACAAATCTCTCTGTCTGAGCAGCAGCTTGTGGACTGCGCTGGGGCTTTCAACAACTTTGGATGTAGTGGTGGACTGCCATCCCAAGCCTTTGAGTACGTCAAGTACAACGGTGGCCTTGATACCGAGGAAGCATATCCTTATACCGCAGTGGATGGTAGCTGCAAATTCTCGGCTGATAATGTTGGTGTCCAAGTGCTCGACTCTGTTAACATCACCTTGGGTGCTGAGGATGAACTAAAGCATGCAGTTGCCTTCGTCCGGCCAGTGAGTGTGGCATTCCAGGTCGTGAAAGACTTCAGATTGTACAAGTCGGGTGTCTACACGAGCGATACATGCGGTAGCACTTCCATGGATGTGAACCATGCTGTTCTCGCTGTTGGTTATGGAGTTGAAGATGGTGTTCCGTTCTGGCTCATCAAGAATTCCTGGGGAGCAGACTGGGGTGACCACGGATACTTCAAGATGGAGATGGGAAAGAACATGTGTGTAAGTCCCCTCCTATATAAAATTCTTTGA
Protein:  
MARARLLCSAVLLLVAVAVSAAASSFEESNPIRLFPDGGLRDLESSIVQIVGRTRHAFSFARFANRYGKRYETAEEIKLRFEIFRENLKLIRSTNKKGLPYTLGVNKFADWSWEEFRRHRLGAAQNCSATTKGNHKLTDEALPEMKDWREKGIVSPIKDQGHCGSCWTFSTTGALEAAYHQAFGKQISLSEQQLVDCAGAFNNFGCSGGLPSQAFEYVKYNGGLDTEEAYPYTAVDGSCKFSADNVGVQVLDSVNITLGAEDELKHAVAFVRPVSVAFQVVKDFRLYKSGVYTSDTCGSTSMDVNHAVLAVGYGVEDGVPFWLIKNSWGADWGDHGYFKMEMGKNMCVSPLLYKIL